
Sequence Listing 

Sequence Number : 1 
Sequence Length : 2578 
/pe of Sequence : Nucleic acid 
idedness : Single 
Topology : Linear 
Molecule Type : Genomic DNA 
Original Source 

Organism : Sutfojobus solfataricus 
Strain : KM1 
Sequence 

GCATGCCATT AAAAGATGTA ACaWaCA CTCCAGACGG TAAGGAGGTT GATGAGAAAG 
CATGGAATTC CCCAACGCAA ACTGTtW TCGTGTTAGA GGGGAGCGTA ATGGATGAGA 
TTAACATCTA TGGAGAGAGA ATTGCGGAtKaTTCATTCTT GATAATTCTT AACGCAAATC 
CCAATAACGT AAAAGTGAAG TTCCCAAAGG GT^AATGGGA ACTAGTTGTT GGTTCTTATT 
TGAGAGAGAT AAAACCAGAA GAAAGAATTG TAGAAUTGA GAAGGAATTG GAAATTGAGG 
GAAGAACAGC ATTAGTTTAT AGGAGGACAG AACT ATG\ATA ATA GGC ACA TAT AGG 

Met ll\lle Gly Tht Tyr Arg 
I \ 5 

^^CTG CA^ CTC AAT AAG AAA TTC ACT TTT TAC GAT ATA ATA^AA AAT TTG 



60 
120 
180 
240 
300 
355 




403 



lA Gin Leu Asn Lys Lys Phe Thr Phe Tyr Asp lie lie GI\Asn Leu 



15 



20 




50 



-181- 



GAT TAT TTT AAA GAA TTA GGA GTA TCA CAC CTA TAT CTA TCT CCA ATA 
Asp Tyr Phe Lys Gin Leu Gly Yal Ser His Leu Tyr Leu Ser Pro lie 

25 30 35 

CTT AAG GCT AGA CCA GGG AGC ACT CAC GGC TAC GAT GTA GTA GAT CAT 
Leu Lys Ala Arg Pro Gly Ser Thr His Gly Tyr Asp Yal Yal Asp His 
40 45 50 55 

ACT GAA ATT AAT GAG GAA TTA GGA GGA GAA GAG GGG TGC TTT AAA CTA 
Ser Glu lie Asn Glu Glu Leu Gly Gly Glu Glu Gly Cys Phe Lys Leu 

60 65 70 

GTT AAG GAA GCT AAG AGT AGA GGT TTA GAA ATC ATA CAA GAT ATA GTG 
Yal Lys Glu Ala Lys Ser Arg Gly Leu Glu lie lie Gin Asp lie Yal 

75 80 • 85 

CCA AAT CAC ATG GCG GTA CAT CAT ACT AAT TGG AGA CTT ATG GAT CTG 
Pro Asn His Met Ala Yal His His Thr Asn Trp Arg Leu Met Asp Leu 

90 95 100 

TTA AAG AGT TGG AAG AAT AGT AAA TAC TAT AAC TAT TTT GAT CAC TAC 
Leu Lys Ser Trp Lys Asn Ser Lys Tyr Tyr Asn Tyr Phe Asp His Tyr 

105 110 115 

GAT GAT GAC.AAG ATA ATC CTC CCA ATA CTT GAG GAC GAG TTG GAT ACC 
Asp Asp Asp Lys lie lie Leu Pro lie Leu Glu Asp Glu Leu Asp Thr 
120 125 130 135 

GTT ATA GAT AAG GGA TTG ATA AAA CTA CAG AAG GAT AAT ATA GAG TAC 
Yal lie Asp Lys Gly Leu lie Lys Leu Gin Lys Asp Asn lie Glu Tyr 
140 145 150 
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AGA GGG CTT ATA TTA CCT ATA AAT GAT GAA GGA GTT GAA TTC TTG AAA 835 
Arg Gly Leu He Leu Pro He Asn Asp Glu Gly Yal Glu Phe Leu Lys 

155 160 165 

AGG ATT AAT TGC TTT GAT AAT TCA TGT TTA AAG AAA GAG GAT ATA AAG 883 
Arg He Asn Cys Phe Asp Asn Ser Cys Leu Lys Lys Glu Asp He Lys 

170 175 180 

AAA TTA CTA TTA ATA CAA TAT TAT CAG CTA ACT TAC TGG AAG AAA GGT 931 
Lys Leu Leu Leu lie Gin Tyr Tyr Gin Leu Th r Tyr Trp Lys Lys Gly 

185 190 195 

TAT CCA AAC TAT AGG AGA TTT TTC GCA GTA AAT GAT TTG ATA GCT GTT 979 
Tyr Pro Asn Tyr Arg Arg Phe Phe Ala Yal Asn Asp Leu He Ala Yal 
200 205 210 215 

AGG GTA GAA TTG GAT GAA GTA TTT AGA GAG TCC CAT GAG ATA ATT GCT 1027 
Arg Yat Glu Leu Asp Glu Yal Phe Arg Glu Ser His Glu lie He Ala 

220 225 230 

AAG CTA CCA GTT GAC GGT TTA AGA ATT GAC CAC ATA GAT GGA CTA TAT 1075 
Lys Leu Pro Yal Asp Gly Leu Arg He Asp His He Asp Gly Leu Tyr 

235 240 245 

AAC CCT AAG GAG TAT TTA GAT AAG CTA AGA CAG TTA GTA GGA AAT GAT 1123 
Asn Pro Lys Glu Tyr Leu Asp Lys Leu Arg Gin Leu Val Gly Asn Asp 

250 255 260 

AAG ATA ATA TAC GTA GAG AAG ATA TTG TCA ATC AAC GAG AAA TTA AGA 1171 
Lys lie He Tyr Yal Glu Lys He Leu Ser lie Asn Glu Lys Leu Arg 
255 270 275 



) 
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GAT GAT 
Asp Asp 
280 

GTT AAT 
Val Asn 

TTT TAT 
Phe Tyr 

ATA CAA 
He Gin 

GAA AGA 
Glu Arg 
345 
TTT CTA 
Phe Leu 
360 

ATT AAC 
lie Asn 

AAG GGA 
Lys Gly 



TGG AAA GTA 
Trp Lys Val 

ATG CTA TTA 
Met Leu Leu 
300 

GAG AAT TTC 
Glu Asn Phe 
315 

AGT AAA AAA 
Ser Lys Lys 
330 

TTA AGC AAG 
Leu Ser Lys 

GCA TGT ATG 
Ala Cy s Me I 

GGA ATA AGG 
Gly lie Arg 
380 

ATC ATG AGA 
lie Met Arg 
395 



GAT GGG ACT 
Asp Gly Thr 
285 

GTA GAT GGA 
Val Asp Gly 

ATT GGA AGG 
lie Gly Arg 

TTA GTT GCA 
Leu Val Ala 
335 

TTA CTG AAC 
Leu Leu Asn 
350 

AAA AAA TAC 
Lys Lys Tyr 
365 

GAA TGC GAT 
Glu Cys Asp 

CTC CAA CAA 
Leu Gin Gin 



ACT GGA TAT 
Thr Gly Tyr 
290 

AGT GGT GAG 
Ser Gly Glu 
305 

AAA ATC AAT 
Lys 1 1 e Asn 
320 

AAT CAG TTA 
Asn Gin Leu 

GTT AAT TAC 
Val Asn Ty r 

AGG ACT TAT 
Arg Thr Tyr 
370 

AAG GAG GGA 
Lys Glu Gly 
385 

TAC ATG CCA 
Tyr Met Pro 
400 



GAT TTC TTG AAC TAC 1219 
Asp Phe Leu Asn Tyr 
295 

GAG GAG TTA ACT AAG 1267 
Glu Glu Leu Thr Lys 
310 

ATA GAC GAG TTA ATA 1315 
lie Asp Glu Leu lie 
325 

TTT AAA GGT GAC ATT 1363 
Phe Lys Gly Asp lie 
340 

GAT TAT TTA GTA GAT 1411 

Asp Ty r Leu Val Asp 

355 

TTA CCA TAT GAG GAT 1459 
Leu Pro Tyr Glu Asp 
375 

AAG TTA AAA GAT GAA 1507 
Lys Leu Lys Asp Glu 
390 

GCA ATC TTC GCT AAG 1555 
Ala lie Phe Ala Lys 
405 



184- 



GGC TAT 
Gly Tyr 

AAC GAG 
Asn Glu 
425 
TTT CAT 
Phe His 
440 

CTT TCC 
Leu Ser 

TCA GTA 
Ser Yal 

TGG CAT 
Trp His 

TTT TAT 
Phe Tyr 
505 
AGA ATT 
Ar g lie 
520 



GAG GAT ACT ACC 
Glu Asp Thr Thr 
410 

GTT GGG AGC GAC 
Yal Gly Ser Asp 



AAC TTT 
Asn Phe 

ACT CAT 
Thr His 

CTA TCT 
Leu Ser 
475 
GAT TTG 
Asp Leu 
490 

CAA ACA 
Gin Thr 



AAC CTA 
Asn Leu 
445 
GAT ACT 
Asp Thr 
460 

GAG ATA 
Glu lie 

TTA AGG 
Leu Arg 

CTT GTG 
Leu Yal 



CTC TTC ATC TAC 

Leu Phe lie Tyr 
415 

CTA AGA AGA TTC 

Leu Arg Arg Phe 
430 

AGC AGA GTA AAT 

Ser Arg Va I Asn 



AAG AAC CAC ATG 
Lys Asn His Me t 
525 



AAA TTC 
Lys Phe 

CCA AAG 
Pro Lys 

CCA AAT 
Pro Asn 
495 
GGA ACT 
Gly Ser 
510 

ATT AAG 
lie Lys 



ACT GAA 
Ser Glu 
465 
GAG TGG 
Glu Trp 
480 

ATT GAT 
lie Asp 

TAC GAG 
Tyr Glu 

GTC ATA 
Yal lie 



AAT AGA TTA 
Asn Arg Leu 
420 

AGT TTA AGC 
Ser Leu Ser 
435 

ACC ATA TCA 
Thr lie Ser 
450 

GAC GTT AGA 
Asp Yal Arg 

GAG GAG AGG 
Glu Glu Arg 

AAA AAC GAT 
Lys Asn Asp 
500 

GGA TTT GAT 
Gly Phe Asp 
515 

AGA GAA GCT 
Arg Glu Ala 
530 



ATT TCC CTT 1603 
He Ser Leu 

ATC AAA GAC 1651 
lie Lys Asp 

ATG AAC ACT 1699 
Met Asn Thr 
455 

GCT AGA ATA 1747 
Ala Arg lie 
470 

GTA ATA TAC 1795 

Yal lie Tyr 

485 

GAG TAT AGA 1843 
Glu Tyr Arg 

AAT AAG GAG 1891 
Asn Lys Glu 

AAG GTA CAT 1939 
Lys Yal His. 
535 
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ACA ACG TGG GAA A AT CCT AAT 
Thr Thr Trp Glu Asn Pro As n 

540 

TTC ATA GAT GAA GTG TTC GAG 
Phe lie Asp Glu Yal Phe Glu 
555 

AAT TTT GAA AAG AAA ATA GTT 
Asn Phe Glu Lys Lys lie Val 
570 

GCA ACG ACA CTT AGG TTC CTT 
Ala Thr Thr Leu Arg Phe Leu 
585 590 
GGA ACT GAA GTT TGG AGA TTC 
Gly Thr Glu Yal Trp Arg Phe 
600 605 
CCG GTG GAT TTC AAG AAA CTA 
Pro Yal Asp Phe Lys Lys Leu 
620 

AAG AAC TTA GAA CTC TCA GAT 
Lys Asn Leu Glu Leu Ser Asp 
635 

AAA TTG CTA CAG CTT AGA AGA 
Lys Leu Leu Gin Leu Arg Arg 
650 



ATA GAG TAT GAA AAG AAG GTT CTG GGT 1987 
lie Glu Tyr Glu Lys Lys Yal Leu Gly 

545 550 
AAC ACT AAT TTT AGA AAT GAT TTT GAA 2035 
Asn Ser Asn Phe Arg Asn Asp Phe Glu 

560 565 
TAT TTC GGT TAT ATG AAA TCA TTA ATC 2083 
Tyr Phe Gly Tyr Met Lys Ser Leu He 
575 580 
TCG CCC GGT GTA CCA GAT ATT TAT CAA 2131 
Ser Pro Gly Yal Pro Asp lie Tyr Gin 
595 

TTA CTT ACA GAC CCA GAT AAC AGA ATG 2179 
Leu Leu Thr Asp Pro Asp Asn Arg Mel 
610 615 
AAG GAA TTA TTA AAT AAT TTG ACT GAA 2227 
Lys Glu Leu Leu Asn Asn Leu Thr Glu 

625 630 
CCA AGA GTC AAA ATG TTA TAT GTT AAG 2275 
Pro Arg Val Lys Met Leu Tyr Yal Lys 

640 645 
GAG TAC TCA CTA AAC GAT TAT AAA CCA 2323 
Glu Tyr Ser Leu Asn Asp Tyr Lys Pro 
655 660 



) 
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TTG CCC TTT GGC TTC CAA AGG GGA AAA GTA GCT GTC CTT TTC TCA CCA 2371 
Leu Pro Ph e Gly Phe Gin Arg Gly Lys Yal Ala Yal Leu Phe Ser Pro 

665 670 675 

ATA GTG ACT AGG GAG GTT AAA GAG AAA ATT AGT ATA AGG CAA AAA AGC 2419 
lie Yal Thr Arg Glu Yal Lys Glu Lys lie Ser lie Arg Gin Lys Ser 
680 685 690 695 

GTT GAT TGG ATC AGA AAT GAG GAA ATT AGT AGT GGA GAA TAC AAT TTA 2467 
Yal Asp Tip He Arg Asn Glu Glu lie Ser Ser Gly Glu Tyr Asn Leu 

700 705 710 

AGT GAG TTG ATT GGG A AG CAT AAA GTC GTT ATA TTA ACT GAA AAA AGG 2515 
Ser Glu Leu lie Gly Lys His Lys Yal Yal lie Leu Thr Glu Lys Arg 

715 720 725 

GAG TGAACTACCT ACATAGATTT ATTCTTGAAC TACTCTGGTC AGAAATGTAT 2568 
Glu 

TACGCAGATC 25? 8 

Sequence Number : 2 
Sequence Length : 728 
Type of Sequence : Amino acid 
Strandedness : Single 
Topology : Linear 
Molecule Type : Protein 
Original Source 

Organism : Sulfolobus solfataricus 




Strain : KM1 
Sequence 

Met He He Gly Thr 
1 5 
Tyr Asp He He Glu 
20 

His Leu Tyr Leu Ser 
35 

Gly Tyr Asp Yal Yal 
50 

Glu Glu Gly Cys Phe 
65 

Glu lie lie Gin Asp 
85 

As n Trp Arg Leu Met 
100 

Tyr Asn Tyr Phe Asp 
115 

Leu Glu Asp Glu Leu 
130 

Gin Lys Asp Asn I le 
145 

Glu Gly Val Glu Phe 
165 



i 
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Tyr Arg Leu Gin Leu Asn 
10 

Asn Leu Asp Tyr Phe Lys 
25 

Pro lie Leu Lys Ala Arg 
40 

Asp His Ser Glu He Asn 

55 

Lys Leu Va I Lys Glu Ala 
7 0 7 5 

He Val Pro Asn His Met 
90 

Asp Leu Leu Lys Ser Trp 
105 

His Tyr Asp Asp Asp Lys 
120 

Asp Thr Ya t He Asp Lys 
135 

Glu Tyr Arg Gly Leu He 
150 155 
Leu L 7 s Arg He Asn Cys 
170 



Lys Lys Phe Thr Phe 
15 

Glu Leu Gly Yal Ser 
30 

Pro Gly Ser Thr His 
45 

Glu Glu Leu Gly Gly 
60 

Lys Ser Arg Gly Leu 
80 

Ala Yal His His Thr 
95 

Lys Asn Ser Lys Tyr 
110 

He He Leu Pro lie 

125 

Gly Leu He Lys Leu 
140 

Leu Pro He Asn Asp 
160 

Phe Asp Asn Ser Cys 
175 
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Leu Lys Lys Glu Asp lie Lys Lys Leu Leu Leu lie Gin Tyr Tyr Gin 

180 185 190 

Leu Thr Tyr Trp Lys Lys Gly Tyr Pro Asn Tyr Arg Arg Phe Phe Ala 

195 200 205 

Yal Asn Asp Leu lie Ala Yat Arg Yal Glu Leu Asp Glu Yal Phe Arg 

210 215 220 

Glu Ser His Glu lie He Ala Lys Leu Pro Yal Asp Gly Leu Arg lie 
225 230 235 240 

Asp His lie Asp Gly Leu Tyr Asn Pro Lys Glu Tyr Leu Asp Lys Leu 

245 250 255 

Arg Gin Leu Yal Gly Asn Asp Lys lie He Tyr Yal Glu Lys lie Leu 

260 265 270 

Ser lie Asn Glu Lys Leu Arg Asp Asp Trp Lys Yal Asp Gly Thr Thr 

275 280 285 

Gly Tyr Asp Phe Leu Asn Tyr Val Asn Mel Leu Leu Yal Asp Gly Ser 

290 295 300 

Gly Glu Glu Glu Leu Thr Lys Phe Tyr Glu Asn Phe lie Gly Arg Lys 
305 310 315 320 

lie Asn lie Asp Glu Leu lie He Gin Ser Lys Lys Leu Val Ala Asn 

325 330 335 

Gin Leu Phe Lys Gly Asp lie Glu Arg Leu Ser Lys Leu Leu Asn Yal 

340 345 350 

Asn Tyr Asp Tyr Leu Yal Asp Phe Leu Ala Cy s Met Lys Lys Tyr Arg 
355 360 365 
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Thr Tyr Leu Pro Tyr Glu Asp lie As n Gly lie Arg Glu Cys Asp Lys 

370 375 380 

Glu Gly Lys Leu Lys Asp Glu Lys Gly He Met Arg Leu Gin Gin Tyr 
385 390 395 400 

Met Pro Ala lie Phe Ala Lys Gly Tyr Glu Asp Thr Thr Leu Phe lie 

405 410 415 

Tyr Asn Arg Leu lie Ser Leu Asn Glu Yal Gly Ser Asp Leu Arg Arg 

420 425 430 

Phe Ser Leu Ser lie Lys Asp Phe His Asn Phe Asn Leu Ser Arg Yal 

435 440 445 

Asn Thr lie Ser Met Asn Thr Leu Ser Thr His Asp Thr Lys Phe Ser 

450 455 460 

Glu Asp Val Arg Ala Arg lie Ser Yal Leu Ser Glu lie Pro Lys Glu 
465 470 475 480 

Trp Glu Glu Arg Yal lie Tyr Trp His Asp Leu Leu Arg Pro Asn lie 

485 490 495 

Asp Lys Asn Asp Glu Tyr Arg Phe Tyr Gin Thr Leu Val Gly Ser Tyr 

500 505 510 

Glu Gly Phe Asp Asn Lys Glu Arg lie Lys Asn His Met Me Lys Yal 

515 520 525 

lie Arg Glu Ala Lys Yal His Thr Thr Trp Glu Asn Pro Asn lie Glu 

530 535 540 

Tyr Glu Lys Lys Val Leu Gly Phe lie Asp Glu Yal Phe Glu Asn Ser 
545 550 555 560 
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Asn Phe Arg As n Asp Phe Glu Asn Phe Glu Lys Ly s Me Val Tyr Phe 

565 570 575 

Gly Tyr Mel Lys Ser Leu lie Ala Th r Tbr Leu Arg Phe Leu Set Pro. 

580 585 590 

Gly Val Pro Asp He Tyr Gin Gly Th r Glu Val Trp Arg Phe Leu Leu 

595 600 605 

Thr Asp Pro Asp Asn Arg Met Pro Yal Asp Phe Lys Lys Leu Lys Glu 

610 615 620 

Leu Leu Asn Asn Leu Tbr Glu Lys Asn Leu Glu Leu Ser Asp Pro Arg 
625 630 635 640 

Val Lys Met Leu Tyr Yal Lys Lys Leu Leu Gin Leu Arg Arg Glu Tyr 

645 650 .6 55 

Ser Leu Asn Asp Tyr Lys Pro Leu Pro Phe Gly Phe Gin Arg Gly Lys 

660 665 670 

Yal Ala Yal Leu Phe Ser Pro lie Yal Thr Arg Glu Val Lys Glu Lys 

675 680 685 

lie Ser lie Arg Gin Lys Ser Yal Asp Trp lie Arg Asn Glu Glu lie 

690 695 700 

Ser Ser Gly Glu Tyr Asn Leu Ser Glu Leu lie Gly Lys His Lys Yal 
705 710 715 720 

Va I lie Leu Thr Glu Lys Arg Glu 
725 



) 
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Sequence Number : 3 

Sequence Length : 3467 

Type of Sequence : Nucleic acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Genomic DNA 

Original Source 

Organism : Sulfolobus acidocaldarius 

Strain : ATCC 33909 



Sequence 



GCTAATAAAC 


TGAACAATGA 


GGACGGAATG 


AATGAAAATT 


ATAGCTGGAA 


TTGTGGAGTA 


6 


GAAGGAGAAA 


CTAACGATTC 


TAATATTCTT 


TATTGTAG AC 


AAAAACAAAG 


AAGAAATTTT 


12 


GTAATAACAT 


TATTTGTTAG 


CCAAGGTATA 


CCAATGATCT 


TAGGGGGAGA 


CGAAATAGGA 


18 


AGAACACAAA 


AAGGCAACAA 


TAATGCTTTT 


TGTCAGGATA 


ATGAGACAAG 


TTGGTATGAT 


24 


TGGAACCTTG 


ATGAAAATCG 


TGTAAGGTTT 


CATGATTTTG 


TGAGGAGACT 


TACCAATTTT 


30 


TATAAAGCTC 


ATCCGATATT 


TAGGAGGGCT 


AGATATTTTC 


AGGGTAAGAA GTTACACGGT 


36 


TCCCCATTAA 


AGGATGTGAC 


GTGGCTAAAA 


CCTGACGGCA 


ATGAAGTTGA 


TGATTCAGTG 


42 


TGGAAATCTC 


CAACAAATCA 


TATTATTTAT 


ATATTAGAGG 


GAAGTGCTAT 


CGATGAAATA 


48 


AATTATAATG 


GAGAAAGGAT 


AGCTGACGAC 


ACTTTTCTAA 


TTATTTTGAA 


TGGAGCAAGT 


54 


ACTAATCTTA 


AGATAAAAGT 


ACCTCATGGA 


AAATGGGAGT 


TAGTGTTACA 


TCCTTATCCA 


60 


CATGAGCCAT 


CTAACGATAA 


AAAGATAATA 


GAAAACAACA 


AAGAAGTAGA 


AATAGATGGA 


66 


AAGACTGCAC 


TAATTTACAG 


GAGGATAGAG 


TTCCAGTGAT 


ATCAGCAACC 


TACAGATTAC 


72 


AGTTAAATAA 


GAATTTTAAT 


TTTGGTGACG 


TAATCGATAA 


CCTATGGTAT 


TTTAAGGATT 


78 



! 
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TAGGAGTTTC CCATCTCTAC CTCTCTCCTG TCTTA ATG GCT TCG CCA GGA AGT AAC 836 

Met Ala Ser Pro Gly Ser Asn 
1 5 

CAT GGG TAC GAT GTA ATA GAT CAT TCA AGG ATA AAC GAT GAA CTT GGA 884 
His Gly Tyr Asp Yal He Asp His Ser Arg lie Asn Asp Glu Leu Gly 

10 15 20 

GGA GAG AAA GAA TAC AGG AGA TTA ATA GAG ACA GCT CAT ACT ATT GGA 932 
Gly Glu Lys Glu Tyr Arg Arg Leu lie GU Thr Ala His Thr lie Gly 

25 30 35 

TTA GGT ATT ATA CAG GAC ATA GTA CCA AAT CAC ATG GCT GTA AAT TCT 980 
Leu Gly lie He Gin Asp He Val Pro Asn His Met Ala Val Asn Ser 
40 45 50 55 

CTA AAT TGG CGA CTA ATG GAT GTA TTA AAA ATG GGT AAA AAG AGT AAA 1028 
Leu Asn Trp Arg Leu Met Asp Val Leu Lys Met Gly Lys Lys Ser Lys 

60 65 TO 

TAT TAT ACG TAC TTT GAC TTT TTC CCA GAA GAT GAT AAG ATA CGA TTA 1076 
Tyr Tyr Thr Tyr Phe Asp Phe Phe Pro Glu Asp Asp Lys He Arg Leu 

75 80 85 

CCC ATA TTA GGA GAA GAT TTA GAT ACA GTG ATA AGT AAA GGT TTA TTA 1124 
Pro lie Leu Gly Glu Asp Leu Asp Thr Val lie Ser Lys Gly Leu Leu 

90 95 100 

AAG ATA GTA AAA GAT GGA GAT GAA TAT TTC CTA GAA TAT TTC AAA TGG 1172 
Lys lie Yal Lys Asp Gly Asp Glu Tyr Phe Leu Glu Tyr Phe Lys Trp 
105 110 H5 
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AAA CTT 
Lys Leu 
120 

AAA CAG 
Lys Gin 

CGA TTC 
Arg Phe 

CAC GTA 
His Yal 

GGC TAT 
Gly Tyr 
185 
ATT AAT 
lie Asn 
200 

AAA ATT 
Lys lie 

ACA GGA 
Thr Gly 



CCT CTA 
Pro Leu 

AAT TAT 
Asn Tyr 

TTC GAT 
Phe Asp 
155 
TTT CAA 
Phe Gin 
170 

AGA ATT 
Arg He 

GAC CTG 
Asp Leu 

CTG GGA 
Leu Gly 

TAT GAC 
Tyr Asp 
235 



ACA GAG 
Thr Glu 
125 
ACC CTA 
Thr Leu 
140 

GTT AAT 
Yal Asn 

GAG TCC 
Glu Ser 

GAT CAT 
Asp His 

AGG TCA 
Arg Ser 
205 
TTT CAG 
Phe Gin 
220 

TTC TTA 
Phe Leu 



GTT GGA 
Yal Gly 

ATG TCT 
Met Ser 

ACT TTA 
Thr Leu 

CAT TCA 
His Ser 
175 
ATT GAT 
He Asp 
190 

ATA ATT 
lie lie 

GAG GAA 
Glu Glu 

AAT TAC 
Asn Tyr 



AAT GAT 
Asn Asp 

TGG AAA 
Trp Lys 
145 
ATA GGA 
He Gly 
160 

AAG ATC 
Lys He 

GGA TTA 
Gly Leu 

AAA AAT 
Lys Asn 

TTA AAA 
Leu Lys 
225 
TCC AAC 
Ser Asn 
240 



ATA TAC GAC 
lie Tyr Asp 
130 

AAT CCT CCT 
Asn Pro Pro 

GTA AAT GTC 
Yal Asn Val 

TTA GAT TTA 
Leu Asp Leu 
180 

TAT GAT CCT 
Tyr Asp Pro 
195 

AAA ATA ATT 
Lys He lie 
210 

TTA AAT TCA 
Leu Asn Ser 

TTA CTG TTT 
Leu Leu Phe 



ACT TTA CAA 1220 
Thr Leu Gin 
135 

AGC TAT AGA 1268 
Ser Tyr Arg 
150 

GAA AAA GAT 1316 

Glu Lys Asp 

165 

GAT GTT GAT 1364 
Asp Ya I Asp 

GAG AAA TAT 1412 
Glu Lys Tyr 

ATT GTA GAA 1460 
He Yal Glu 
215 

GAT GGA ACT 1508 
Asp Gly Thr 
230 

AAT TTT AAT 1556 

Asn Phe Asn 

245 
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CAA GAG 
Gin G I u 

TCT ATA 
Ser He 
265 
CTA TTT 
Leu Phe 
280 

TAC GAT 
Tyr Asp 

TAT GCT 
Tyr Ala 

GAA GCA 
G 1 u Ala 

ATG CCA 
Met Pro 
345 
TAC AAT 
Tyr Asn 
360 



ATA ATG 
lie Met 

250 

ACT GAA 
Ser G I u 

AGT TAT 
Ser Tyr 

ATA TTG 
lie Leu 

AAT CAG 
Asn Gin 
315 
ACC AAA 
Thr Lys 
330 

GCA GTA 
Ala Val 

AG A TTA 
A r g Leu 



GAC AGT 
Asp Ser 

AGT ATA 
Ser lie 

GAA GTT 
Glu Val 
285 
AGA GAT 
Arg Asp 
300 

ATT GTA 
lie Val 

AGA AAT 
Arg Asn 

TAC GCT 
Tyr Ala 

ATA TCC 
lie Ser 
365 



ATA TAT 
Me Tyr 
255 
AAG AAA 
Lys Lys 
270 

AAA AGA 
Lys Arg 

TAC CTT 
Tyr Leu 

AAA GAG 
Lys Glu 

CCA GAG 
Pro Glu 
335 
AAA GCT 
Lys Ala 
350 

ATA AAT 
lie Asn 



GAG AAT TTC 
Glu Asn Phe 

ATA AAA GCG 
lie Lys Ala 

TTA GCA TCA 
Leu Ala Ser 
290 

TCT TGT ATA 
Ser Cy s lie 
305 

TGT GAT AAG 
Cys Asp Lys 
320 

GCT TAT ACT 
Ala Tyr Thr 

TAT GAA GAT 
Tyr Glu Asp 

GAG GTT GGA 
Glu Val Gly 
370 



ACA GCG 
Thr Ala 
260 
CAA ATA 
Gin lie 
275 

CAA CTA 
Gin Leu 

GAT GTG 
Asp Val 

ACC AAT 
Thr Asn 

AAA TTA 
Lys Leu 
340 
ACT TTC 
Thr Phe 
355 

AGC GAT 
Ser Asp 



GAG AAA ATA 1604 
Glu Lys lie 

ATT GAT GAG 1652 
1 le Asp Glu 

GGA ATT AGC 1700 
Gly Me Ser 
295 

TAC AGA ACT 1748 
Tyr Arg Thr 
310 

GAG ATA GAG 1796 

Glu lie Glu 

325 

CAA CAA TAT 1844 
Gin Gin Tyr 

CTC TTT AGA 1892 
Leu Phe Arg 

TTA CGA TAT 1940 
Leu Arg Tyr 
375 
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TAT AAG 
Tyr Lys 

GGA AAA 
G I y Lys 

GAA GAT 
Glu Asp 

TGG AAA 
Trp Lys 
425 
TCA AGA 
Se r Arg 
440 

TAT GAG 
Tyr Glu 

ATA AAA 
lie Lys 

AAT AAA 
As n Lys 



ATA TCG 
He Ser 

ATC ACA 
lie Th r 
395 
GTA AGG 
Val A r g 
410 

AAT AAG 
Asn Lys 

AAT GAT 
Asn Asp 

GGA TTC 
Gly Phe 

ACT GTC 
Ser Val 
475 
GAA TAT 
Glu Tyr 
490 



CCT GAT CAG TTT CAT GTA TTT AAT 

Pro Asp Gin Phe His Val Phe Asn 

380 385 

CTA AAT GCC ACT AGC ACA CAT GAT 

Leu Asn Ala Thr Ser Th r His Asp 



ATG AAA 
Met Lys 

GTC GAG 
Val Glu 

GAA TAT 
Glu Tyr 
445 
TCT AAT 
Ser Asn 
460 

AGA GAA 
Arg Glu 

GAA AAT 
Glu Asn 



ATA ACT 
lie Ser 
415 
GAA TGG 
Glu Trp 
430 

AGA TAT 
Arg Tyr 

GAT TTT 
Asp Phe 

GCT AAG 
Ala Lys 

AGA GTA 
Arg Yal 
495 



GTA TTA ACT GAA 
Yal Leu Ser Glu 



CAT AGT 
His Ser 

TAT CAG 
Tyr Gin 

AAG GAG 
Lys Glu 
465 
ATA AAT 
lie Asn 
480 

ATG GAA 
Met Glu 



ATC ATA 
lie lie 
435 
GTT TTA 
Yal Leu 
450 

AGA ATA 
Arg lie 

ACC TCA 
Thr Ser 

TTA GTG 
Leu Yal 



CAA AAA 
Gin Lys 

ACT AAG 
Thr Lys 
405 
TTT CCT 
Phe Pro 
420 

AAT CCA 
Asn Pro 

GTG GGA 
Yal Gly 

AAG CAA 
Lys Gin 

TGG AGA 
Trp Arg 
485 
GAA GAA 
Glu Glu 
500 



CGA AGA 
Arg Arg 
390 

TTT AGT 
Phe Ser 

GAA GAA 
Glu Glu 

AAG GTA 
Lys Yal 

AGT TTT 
Ser Phe 
455 
CAT ATG 
His Met 
470 

AAT CAA 
Asn Gin 

ACT TTT 
Thr Phe 



1988 



2036 



2084 



2132 



2180 



2228 



227 6 



2324 




ACC AAT AAG GAT TTC ATT AAA AGT TTC ATG AAA TTT GAA ACT A AG ATA 2372 
Thr Asn Lys Asp Phe lie Lys Ser Ph e Met Lys Phe Glu Se.r Lys lie 

50 5 5 1 0 5 1 5 

AGA AGG ATA GGG ATG ATT AAG AGC TTA TCC TTG GTC GCA TTA AAA ATT 2420 
A r g Arg lie Gly Met lie Lys Ser Leu Ser Leu Yal Ala Leu Lys lie 
520 525 530 535 

ATG TCA GCC GGT ATA CCT GAT TTT TAT CAG GGA ACA GAA ATA TGG CGA 2468 
Met Ser Ala Gly lie Pro Asp Phe Tyr Gin Gly Thr Glu lie Trp Arg 

540 545 550 

TAT TTA CTT ACA GAT CCA GAT AAC AGA GTC CCA GTG GAT TTT AAG AAA 2516 
Tyr Leu Leu Thr Asp Pro Asp Asn Arg Val Pro Yal Asp Phe Lys Lys 

555 560 565 

TTA CAC GAA ATA TTA GAA AAA TCC AAA AAA TTT GAA AAA AAT ATG TTA 2564 
Leu His Glu lie Leu Glu Lys Ser Lys Lys Phe Glu Lys Asn Met Leu 

570 575 580 

GAG TCT ATG GAC GAT GGA AGA ATT AAG ATG TAT TTA ACA TAT AAG CTT 2612 
Glu Ser Met Asp Asp Gly Arg lie Lys Met Tyr Leu Thr Tyr Lys Leu 

585 590 595 

TTA TCC CTA AGA AAA CAG TTG GCT GAG GAT TTT TTA AAG GGC GAG TAT 2660 
Leu Ser Leu Arg Lys Gin Leu Ala Glu Asp Phe Leu Lys Gly Glu Tyr 
600 605 610 615 

AAG GGA TTA GAT CTA GAA GAA GGA CTA TGT GGG TTT ATT AGG TTT AAC 2108 
Lys Gly Leu Asp Leu Glu Glu Gly Leu C 7 5 Gly Phe lie Arg Phe Asn 
620 625 630 
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AAA ATT TTG GTA ATA ATA AAA ACC AAG GGA AGT GTT A AT TAC AAA CTG 2756 

Lys lie Leu Yal lie lie Lys Thr Lys Gly Ser Va 1 Asn Tyr Lys Leu 

635 640 645 

AAA CTT GAA GAG GGA GCA ATT TAC ACA GAT GTA TTG ACA GGA GAA GAA 2804 

Lys Leu Glu Glu Gly Ala lie Tyr Thr Asp Val Leu Thr Gly Glu Glu 

650 655 660 

ATT AAA AAA GAG GTA CAG ATT AAT GAG CTA CCT AGG ATA CTA GTT AGA 2852 

lie Lys Lys Glu Val Gin lie Asn Glu Leu Pro Arg lie Leu Val Arg 

665 670 675 

ATG TAAGTTATAA TAATCCGATT TTTATGTGAC AAGATTTACG CTTACGAAAA 2905 
Met 



680 



GGACTGTTAA 


ATCAACTTTT 


ATGTGAATTA 


TGAAACGTAA 


ATTATAAGTT 


TCCTGAGGAT 


2965 


AAACATATAT 


ATCTCTATCT 


CTCATTGATA 


TCACATGAGT 


ATTAGATTAA 


GGGGAAGTAA 


3025 


TTCTTACGGA 


CATTCAGGCT 


GGTTTACAGT 


ATACTGTAGA 


ATATGTAATA 


GGAAAATAAG 


3085 


AATAGGAACG 


GACTTAGTCT 


ACAAATGCCC 


TAAATGTGAA 


AAGAAGTATA 


ACGCATTCTT 


3145 


CTGTGAAGCA 


GATGCTAGGG 


GATTAAAGAA 


AAAGTGCCCA 


TACTGTGGTA 


CTGAACTTGT 


3205 


CAGTGCAATT 


TAAGACTCAA 


ATAGAAGGTA 


AAAATATTTT 


TATACTGAAT 


AATGAGTTGT 


3265 


TTTACGCTGA 


TACGGATATA 


GTTATTCGAA 


ATCAAGATTT 


TATTAAGAAA 


CTCACCTTTA 


3325 


CACAATATAA 


TAAGATTGCC 


TATATTGACA 


TGGACATAGA 


AACGACAGAA 


TTTAAGATAT 


33 85 


TAAGATTAGT 


AGTGTGTAAA 


ACTAGAATAA 


ATATTTATGT 


TTGCAACGTA 


ATTGGTAAAT 


34 4 5 


TGAAAGAAAC 


TAATTTTGAA 


AA 








3467 



-1.98 

Sequence Number : 4 
Sequence Length : 680 
Type of Sequence : Amino acid 
Strandedness : Single 
Topology : Linear 
Molecule Type : Protein 
Original Source 

Organism : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Met Ala Ser Pro G I y Ser Asn His Gly Tyr Asp Val lie Asp His Ser 

1 5 10 15 

A r g lie Asn Asp Glu Leu Gly Gly Glu Ly s Glu Tyr Arg Arg Leu lie 

20 25 30 

Glu Thr Ala His Thr lie Gly Leu Gly lie He Gin Asp lie Val Pro 

35 40 45 

Asn His Mel Ala Yal Asn Ser Leu Asn Trp Arg Leu Met Asp Yal Leu 

50 55 60 

Lys Met Gly Lys Lys Ser Lys Tyr Tyr Thr Tyr Phe Asp Phe Phe Pro 
65 70 75 80 

Glu Asp Asp Lys lie Arg Leu Pro lie Leu Gly Glu Asp Leu Asp Tbr 

85 90 95 

Val lie Ser Lys Gly Leu Leu Lys lie Yal Lys Asp Gly Asp Glu Tyr 
1 0 0 1 0 5 HO 
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Phe Leu Glu Tyt Phe Lys Trp Lys Leu Pro Leu Thr Glu Val Gly Asn 

115 120 125 

Asp lie Tyr Asp Thr Leu Gin Lys Gin Asn Tyr Thr Leu Met Ser Trp 

130 135 140 

Lys Asn Pro Pro Ser Tyr Arg A r g Phe Phe Asp Val Asn Thr Leu He 
H 5 1 50 1 55 1 60 

Gly Val Asn Val Glu Lys Asp His Val Phe Gin Glu Ser His Ser Lys 

165 HO 175 

lie Leu Asp Leu Asp Val Asp Gly Tyr Arg lie Asp His lie Asp Gly 

180 185 190 

Leu Tyr Asp Pro Glu Lys Tyr lie Asn Asp Leu Arg Ser lie lie Lys 

195 200 205 

Asn Lys lie lie lie Val Glu Lys lie Leu Gly Phe Gin Glu Glu Leu 

210 215 220 

Lys Leu Asn Ser Asp Gly Thr Thr Gly Tyr Asp Phe Leu Asn Tyr Ser 
225 230 235 240 

Asn Leu Leu Phe Asn Phe Asn Gin Glu lie Mel Asp Ser lie Tyr Glu 

245 250 255 

Asn Phe Thr Ala Glu Lys He Ser lie Ser Glu Ser lie Lys Lys lie 

2 6 0 2 65. 27 0 

Lys Ala Gin lie lie Asp Glu Leu Phe Ser Tyr Glu Val Lys Arg Leu 

275 280 285 

Ala Ser Gin Leu Gly lie Ser Tyr Asp lie Leu Arg Asp Tyr Leu Ser 
290 295 300 
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Cys He Asp Val Tyr A r g Thr Tyr Ala Asn Gin lie Yal Lys Glu Cys 



305 



310 



315 



320 



Asp Lys Thr Asn Glu He Glu Glu Ala Thr Lys Arg Asn Pro Glu Ala 



325 



330 



335 



Tyr Thr Lys Leu Gin Gin Tyr Met Pro Ala Yal Tyr Ala Lys Ala Tyr 



340 



345 



350 



Glu Asp Thr Phe Leu ?he Arg Tyr Asn Arg Leu lie Ser lie Asn Glu 



355 



360 



365 



Val Gly Ser Asp Leu Arg Tyr Tyr Lys He Ser Pro Asp Gin Phe His 



370 



375 



380 



Val Phe Asn Gin Lys Arg Arg Gly Lys lie Thr Leu Asn Ala Thr Ser 



385 



390 



395 



400 



Thr His Asp Thr Lys Phe Ser Glu Asp Yal Arg Mel Lys He Ser Yal 



405 



410 



415 



Leu Ser Glu Phe Pro Glu Glu Trp Lys Asn Lys Yal Glu Glu Trp His 



420 



425 



430 



Ser lie He Asn Pro Lys Val Ser Arg Asn Asp Glu Tyr Arg Tyr Tyr 



435 



445 



Gin Yal Leu Yal Gly Ser Phe Tyr Glu Gly Phe Ser Asn Asp Phe Lys 



450 



455 



460 



Glu Arg He Lys Gin His Met lie Lys Ser Yal Arg Glu Ala Lys He 



465 



470 



475 



480 



Asn Thr Ser Trp Arg Asn Gin Asn Lys Glu Tyr Glu Asn Arg Yal Met 



485 



490 



495 



3 
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Glu Leu Val Glu Gin Thr Phe Thr As n Lys Asp Phe lie Lys Ser Phe 

500 505 510 

Met Lys Phe Glu Ser Lys lie A r g A r g lie Gly Met He Lys Ser Leu 

515 520 525 

Ser Leu Val Ala Leu Lys lie Met Ser Ala Gly He Pro Asp Phe Tyr 

530 535 540 

Gin Gly Thr Glu lie Trp Arg Tyr Leu Leu Thr Asp Pro Asp Asn Arg 
545 550 555 560 

Val Pro Val Asp Phe Lys Lys Leu His Glu He Leu Glu Lys Ser Lys 

565 570 575 

Lys Phe Glu Lys Asn Met Leu Glu Ser Met Asp Asp Gly Arg He Lys 

580 585 590 

Met Tyr Leu Thr Tyr Lys Leu Leu Ser Leu Arg Lys Gin Leu Ala Glu 

595 600 605 

Asp Phe Leu Lys Gly Glu Tyr Lys Gly Leu Asp Leu Glu Glu Gly Leu 

610 615 620 

Cys Gly Phe lie Arg Phe Asn Lys He Leu Yal lie lie, Lys Thr Lys 
625 630 635 640 

Gly Ser Yal Asn Tyr Lys Leu Lys Leu Glu Glu Gly Ala lie Tyr Thr 

645 650 655 

Asp Val Leu Thr Gly Glu Glu lie Lys Lys Glu Val Gin lie Asn Glu 

660 665 670 

Leu Pro Arg He Leu Val Arg Me t 
675 680 
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Sequence Number : 5 

Sequence Length ; 2691 

Type of Sequence : Nucleic acid 

Stranded n ess : Single 

Topology : Linear 

Molecule Type : Genomic DNA 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

CTGCAGTAAC TAGCGCTATC GAAGACGTTA TAAAGAGAAG GATAAATAGA GTTCCAGTGA 60 
GTCTAGAAGA CCTTTTTGAA TAAGGACTTT AATATCATTT AAATTTATTT TTTGCAAC AT 120 
GCAGAGGTAA ACCCATGAAT GTCATTTTCG ACGTATTAAA CGAGATCCAT GGGTTTTTTG 180 
GTGCATTGTG GGCGGGAGCA GCTCTACTTA ACTACTTAGT TAAGCCTCAA GATAAGAGGC 240 
AATTTGAGAG AATAGGGAAA TTCTTCATGA TAAACTCAGT CATTACAGTA ATAACTGGGA 300 
TAATAATTTT CGCCTACATT TACCTAGCCC CTTATCAAGG GAATTTATTT CTAGTAGCGG 360 
CAATTCTACG TTCAAGCCTT GACATTAGGT TAAGGGCCTT ACTAAACTTA ATAGGAGGAG 420 
CGTTTGGGTT ATTGGCTTTT GGGGCAGGGA TAGTTATAAG CAATAGGATA AGGCTTATGG 480 
TACGTGTTAA GGAAGGTGAC GCTACAATCC TAGAGTTGAG GAATAGTATT GCCAATTTAT 540 
CTAAAATTAG TTTAATCTTC TTATTACTTT CCTTAGCCAT GATGATACTT GCTGGTTCCA 600 
TAGCACAAGT TATAAGTTAG AGTTGAAAGA AAAATTTA ATG ACG TTT GCT TAT AAA 656 

Met Thr Phe Ala Tyr L y s 
0 1 5 
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ATA GAT GGA AAT GAG GTA ATC TTT ACC TTA TGG GCA CCT TAT CAA AAG 704 
He Asp Gly Asn Glu Va 1 lie Phe Thr Leu Trp Ala Pro Tyr Gin Lys 

10 15 20 

AGC GTT AAA CTA AAG GTT CTA GAG AAG GGA CTT TAC GAA ATG GAA AGA 752 
Ser Yal Lys Leu Lys V a I Leu Glu Lys Gly Leu Tyr Glu Met Glu Arg 

25 30 35 

GAT GAA AAA GGT TAC TTC ACC ATT ACC TTA AAC AAC GTA AAG GTT AGA 800 
Asp Glu Lys Gly Tyr Phe Thr lie Thr Leu Asn Asn Yal Lys Yal Arg 

40 45 50 

GAT AGG TAT AAA TAC GTT TTA GAT GAT GCT AGT GAA ATA CCA GAT CCA 848 
Asp Arg. Tyr Lys Tyr Yal Leu Asp Asp Ala Ser Glu lie Pro Asp Pro 

55 60 65 

GCA TCC AGA TAC CAA CCA GAA GGT GTA CAT GGG CCT TCA CAA ATT ATA 896 
Ala Ser Arg Tyr Gin Pro Glu Gly Yal His Gly Pro Ser Gin lie lie 
70 75 80 85 

CAA GAA AGT AAA GAG TTC AAC AAC GAG ACT TTT CTG AAG AAA GAG GAC 944 
Gin Glu Set Lys Glu Phe Asn Asn Glu Thr Phe Leu Lys Lys Glu Asp 

90 95 100 

TTG ATA ATT TAT GAA ATA CAC GTG GGG ACT TTC ACT CCA GAG GGA ACG 992 
Leu lie lie Tyr Glu lie His Val Gly Thr Phe Thr Pro Glu Gly Thr 

105 110 H5 

TTT GAG GGA GTG ATA AGG AAA CTT GAC TAC TTA AAG GAT TTG GGA ATT 1040 
Phe Glu Gly Yal lie Arg Lys Leu Asp Tyr Leu Lys Asp Leu Gly lie 
120 125 130 



-204- 



ACG GCA ATA 
Th r Ala lie 

135 

TGG GGT TAT 
Tip Gly Tyr 
150 

GGG CCA GAA 
Gly Pro Glu 

TTA GGA GTT 
Leu Gly Yal 

AAC TAT ATG 
Asn Tyr Met 
200 

CCA TGG GGA 
Pro Trp Gly 
215 

AGG AAG TTC 
Arg Lys Phe 
230 

GTT GAT GGG 
Val Asp Gly 



GAG ATA 
Glu lie 

GAT GGA 
Asp Gly 

GGT TTT 
Gly Phe 

no 

ATT TTA 
lie Leu 
185 

GTT AAA 
Val Lys 

TTA ACC 
Leu Th r 

ATC TTA 
lie Leu 

TTT AGA 
Phe Arg 
250 



ATG CCA 
Met Pro 
HO 
GTT TAT 
Val Tyr 
155 

AGA AAG 
Arg Lys 

GAC GTA 
Asp Val 

TTG GGG 
Leu Gly 

TTT AAC 
Phe Asn 
220 
GAA AAC 
Glu Asn 
235 

TTA GAT 
Leu Asp 



ATA GCT CAA 
lie Ala Gin 

TTA TAT GCA 
Leu Ty r Ala 

TTA GTT GAT 
Leu Val Asp 
175 

GTA TAC AAC 
Yal Tyr Asn 
190 

CCA TAT TTC 
Pro Tyr Phe 
205 

TTT GAC GAT 
Phe Asp Asp 

GTT GAG TAC 
Yal Glu Tyr 

GCG GTT CAT 
Ala Yal His 
255 



TTT CCT 
Phe Pro 
145 
GTA CAG 
Yal Gin 
160 

GAA GCG 
Glu Ala 

CAC GTT 
His Val 

TCA CAG 
Ser Gin 

GCT GAA 
Ala Glu 
225 
TGG ATT 
Trp He 
240 

GCA ATT 
Ala lie 



GGG AAA AGG GAT 
Gly Lys Arg Asp 



1088 



AAC TCT 
Asn Ser 

CAC AAG 
His Lys 

GGA CCA 

Gly Pro 
195 

AAA TAC 

Lys Tyr 
210 

AGC GAT 

Ser Asp 



TAC GGA 
Tyr Gly 
165 
AAA GGT 
Lys Gly 
180 

GAG GGA 
Glu Gly 

AAA ACG 
Lys Thr 

GAG GTT 
Glu Yal 



AAG GAA TAT AAC 
Lys Glu Tyr Asn 
245 

ATT GAC ACT TCT 
lie Asp Thr Ser 
260 



1136 



1184 



1232 



1280 



1328 



1376 



1424 
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CCT AAG 
Pro Lys 

AGG ATT 
Arg lie 

CCC AAG 
Pro Lys 
295 
TTC CAT 
Phe His 
310 

TAT ACG 
Tyr Thr 

GTT TTC 
Yal Phe 

GGA GAA 
Gly Glu 

CAA AAT 
Gin As n 
375 



CAC ATC TTG GAG 

His lie Leu Glu 

265 

GTC ATA GCC GAA 

Yal He Ala Glu 
280 

GAA AAG TGT GGA 

Glu Lys Cys Gly 



CAT TCT 
His Ser 

GAT TTC 
Asp Phe 

GTA TAT 
Yal Tyr 
345 
CCA GTT 
Pro Yal 
360 

CAC GAT 
His Asp 



ATT CAC 
lie His 
315 
GGT AAC 
Gly Asn 
330 

GAT GGT 
Asp Gly 

GGT GAA 
Gly Glu 

CAA GTC 
Gin Yal 



GAA ATA 
Glu lie 

AGT GAT 
Ser Asp 
285 
TAT AAT 
Tyr Asn 
300 

GCT TAC 
Ala Tyr 

CTT GAC 
Leu Asp 

AAG TAC 
Lys Ty r 

CTA GAC 
Leu Asp 
365 
GGA AAT 
Gly Asn 
380 



GCT GAC 
Ala Asp 
270 

TTA AAC 
Leu Asn 

ATT GAT 
lie Asp 

TTA ACT 
Leu Thr 

GAT ATA 
Asp lie 
335 
TCC AAT 
Ser Asn 
350 

GGA TGC 
Gly Cys 

AGA GGC 
Arg Gly 



GTT GTG 
Yal Yal 

GAT CCT 
Asp Pro 

GCT CAA 
Ala Gin 
305 
GGT GAG 
Gly Glu 
320 

GTT AAA 
Yal Lys 

TTT AGA 
Phe Arg 

AAT TTC 
Asn Phe 

AAA GGT 
Lys Gly 
385 



CAT AAG TAT AAT 
His Lys Tyr Asn 
27 5 

AGA GTC GTT AAT 
Arg Yal Yal Asn 
290 

TGG GTT GAC GAT 
Trp Yal Asp Asp 



1472 



GGC TAT 
Gly Ty r 
325 
AAG GAC 
Lys Asp 
340 

ACT CAC 
Thr His 

TAT ATA 
Tyr lie 



AGG CAA 
Arg Gin 

TCG TAT 
Ser Tyr 

AGA AAA 
Arg Lys 
355 
GTA GTT 
Yal Yal 
370 

GAA AGA ATA ATT 
Glu Arg lie lie 



1520 



1568 



1616 



1664 



1712 



1760 



1808 
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AAA TTA GTC GAT AGG GAA AGC TAC AAG ATC GCT GCA GCC CTT TAC CTT 1356 

Ly s Leu Yal Asp Arg Gin Set Tyr Lys lie Ala Ala Ala Leu Tyr Leu 

390 395 400 405 

CTT TCC CCC TAT ATT CCA ATG ATT TTC ATG GGA GAG GAA TAC GGT GAG 1904 

Leu Set Pro Tyr lie Pro Met lie Phe Met Gly Glu Glu Tyr Gly Glu 

410 415 420 

GAA AAT CCC TTT TAT TTC TTT TCT GAT TTT TCA GAT TCA AAA CTG ATA 1952 
Glu Asn Pro Phe Tyr Phe Phe Ser Asp Phe Ser Asp Ser Lys Leu lie 

425 430 435 

CAA GGT GTA AGG GAA GGG AGA AAA AAG GAA AAC GGG CAA GAT ACT GAC 2000 
Gin Gly Yal Arg Glu Gly Arg Lys Lys Glu Asn Gly Gin Asp Thr Asp 

440 445 450 

CCT CAA GAT GAA TCA ACT TTT AAC GCT TCC AAA CTG AGT TGG AAG ATT 2048 
Pro Gin Asp Glu Ser Thr Phe Asn Ala Ser Lys Leu Ser Trp Lys lie 

455 460 465 

GAC GAG GAA ATC TTT TCA TTT TAC AAG ATT TTA ATA AAA ATG AGA AAG 2096 
Asp Glu Glu lie Phe Ser Phe Tyr Lys lie Leu lie Lys Met Arg Lys 
470 475 480 485 

GAG TTG AGC ATA GCG TGT GAT AGG AGA GTA AAC GTC GTG AAT GGC GAA 2144 
Glu Leu Ser lie Ala Cys Asp Arg Arg Yal Asn Val Val Asn Gly Glu 

490 495 500 

AAT TGG TTG ATC ATC AAG GGA AGA GAA TAC TTT TCA CTC TAC GTT TTC 2192 
Asn Trp Leu lie He Lys Gly Arg Glu Tyr Phe Set Leu Tyr Val Phe 
505 510 515 
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) 



TCT AAA TCA TCT ATT GAA GTT AAG TAC ACT GGA ACT TTA CTT TTG TCC 2240 
Ser Lys Ser Ser lie Glu Yal Lys Tyr Ser Gly Thr Leu Leu Leu Set 

520 525 530 

TCA AAT AAT TCA TTC CCT CAG CAT ATT GAA GAA GGT AAA TAT GAG TTT 2288 
Ser Asn Asn Ser Phe Pro Gin His lie Glu Glu Gly Lys Tyr Glu Phe 

535 540 545 

GAT AAG GGA TTT GCT TTA TAT AAA CTT TAGGACA GGAGAGTTTA AAAATTTCTA 2342 
Asp Lys Gly Phe Ala Leu Tyr Lys Leu 
550 555 

TGAATGATTA TACTTTAGAT GATGAGTAAA AGCAAGATCG ATGAGGAAGA GAAAAGGAGA 2402 
AGAGAAGAAG TCAAAAAGTT AGTAATGCTC TTAGCAATGT TAAGATAATG TTTTTTTAAA 2462 
CTCAAATAAT AATAAATACC ATCATGTCAA TATTCTTCAG AACTAGAGAT AGACCTTTAC 2522 
GTCCCGGAGA TCCGTATCCA TTAGGTTCAA ATTGGATAGA AGATGAGGAT GGCGTAAATT 2582 
TTTCCTTGTT CTCAGAGAAT GCAGACAAAG TGGAGTTGAT TCTTTATTCA CAAACAAATC 2642 
AAAAGTATCC AAAGGAGATA ATAGAGGTTA AGAATAGAAC GGGGGATCC 2691 

Sequence Number : 6 
Sequence Length : 558 
Type of Sequence : Amino acid 
Strandedness : Single 
Topology : Linear 
Molecule Type : Protein 
Original Source 

Organism : Sulfolobus solfataricus 
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Strain : KM1 
Sequence 

Thr Phe Ala Tyr Lys lie Asp Gly Asn Glu Val He Phe Thr Leu Trp 

15 10 15 

Ala Pro Tyr Gin Lys Ser Val Lys Leu Lys Val Leu Glu Lys Gly Leu 

20 25 30 

Tyr Glu Met Glu Arg Asp Glu Lys Gly Tyr Phe Thr He Thr Leu Asn 

35 40 45 

Asn Val Lys Val Arg Asp Arg Tyr Lys Tyr Val Leu Asp Asp Ala Ser 

50 55 60 

Glu He Pro Asp Pro Ala Ser Arg Tyr Gin Pro Glu Gly Val His Gly 
6 5 7 0 7 5 8 0 

Pro Ser Gin lie lie Gin Glu Ser Lys Glu Phe Asn Asn Glu Thr Phe 

85 90 95 

Leu Lys Lys Glu Asp Leu lie lie Tyr Glu lie His Val Gly Thr Phe 

100 105 HO 

Thr Pro Glu Gly Thr Phe Glu Gly Val lie Arg Lys Leu Asp Tyr Leu 

115 120 125 

Lys Asp Leu Gly lie Thr Ala lie Glu lie Met Pro lie Ala Gin Phe 

130 135 140 

Pro Gly Lys Arg Asp Trp Gly Tyr Asp Gly Val Tyr Leu Tyr Ala Val 
145 150 155 160 

Gin Asn Ser Tyr Gl? Gly Pro Glu Gly Phe Arg Lys Leu Val Asp Glu 
165 170 175 



) 
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1 



Ala His Lys Lys Gly Leu G I y Val lie Leu Asp Yal Val Tyr Asn His 

180 185 190 

Val Gly Pro Glu Gly Asn Tyr Met Val Lys Leu Gly Pro Tyr Phe Ser 

195 200 205 

Gin Lys Tyr Lys Thr Pro Trp Gly Leu Thr Phe Asn Phe Asp Asp Ala 

210 215 220 

Glu Ser Asp Glu Yal Arg Lys Phe lie Leu Glu Asn Yal Glu Tyr Trp 
225 230 235 240 

He Lys Glu Tyr Asn Val Asp Gly Phe Arg Leu Asp Ala Yal His Ala 

245 250 255 

lie lie Asp Thr Ser Pro Lys His lie Leu Glu Glu lie Ala Asp Yal 

260 265 270 

Val His Lys Tyr Asn Arg Me Val lie Ala Glu Ser Asp Leu Asn Asp 

275 280 285 

Pro Arg Val Val Asn Pro Lys Glu Lys Cys Gly Tyr Asn lie Asp Ala 

290 295 300 

Gin Trp Val Asp Asp Phe His His Ser lie His Ala Tyr Leu Thr Gly 
305 310 315 320 

Glu Arg Gin Gly Tyr Tyr Thr Asp Phe Gly Asn Leu Asp Asp He Val 

325 330 335 

Lys Ser Tyr Lys Asp Val Phe Val Tyr Asp Gly Lys Tyr Ser Asn Phe 

340 345 350 

Arg Arg Lys Thr His Gly Glu Pro Val Gly Glu Leu Asp Gly Cys Asn 
355 360 365 



Phe Yal Val Tyr 
370 

Gly Glu A r g lie 
385 

Ala Ala Leu Ty r 

Glu Glu Tyr Gly 

420 

Asp Se r Lys Leu 
435 

Gly Gin Asp T h r 
450 

Leu S e r Trp Lys 
465 

lie Lys Met A t g 

Val Val Asn Gly 
5O0 

Se r Leu Tyr Yal 
515 

Th r Leu Leu Leu 
530 

Gly Lys Tyr Glu 
545 



lie Gin Asn His 
375 

lie Lys Leu Yal 
390 

Leu Leu Se r Pro 
405 

Glu Glu Asn Pro 

lie Gin Gly Val 
440 

Asp Pro Gin Asp 
455 

lie Asp Glu Glu 
470 

Lys Glu Leu Se r 
485 

Glu Asn Trp Leu 

Phe Se r Lys Se r 
520 

S e r S e r Asn Asn 
535 

Phe Asp Lys Gly 
550 
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Asp Gin Val Gly 

380 

Asp Arg Glu Ser 
395 

Tyr He Pro Met 
410 

Phe Tyr Phe Phe 
425 

Arg Glu Gly Arg 

Glu Ser Th r Phe 

460 

lie Phe Ser Phe 
475 

lie Ala Cys Asp 
490 

Me lie Lys Gly 
505 

Ser He Glu Val 

Ser Phe Pro Gin 
540 

Phe Ala Leu Tyr 
555 



Asn Arg Gly Lys 

Tyr Lys He Ala 
400 

He Phe Met Gly 
415 

Ser Asp Phe Ser 
430 

Lys Lys Glu Asn 
445 

Asn Ala Ser Lys 

Tyr Lys lie Leu 
480 

Arg Arg Val Asn 

*" "4 95 
Arg Glu Tyr Phe 

510 

Lys Tyr Ser Gly 
525 

His lie Glu Glu 
Lys Leu 
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Sequence Number : 7 

Sequence Length : 3600 

Type of Sequence : Nucleic acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Genomic DNA 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 



Sequence 



1 TTPPTTTTP 

ATTCGTTTTO 


t pTf 1 k P T P P P 

AGTCACTCGG 


PPT A P P T P T P~ 

CGTAGGTCTG 


i AG! 1 1 1 1 1 1 


ACATAATGCT 


TGCCAAGAAT 


CGAAGAAGGC 


GTCCTGCCCT 


GCACTAACTC 


CGAGCTCCGC 


GAGTTTAGTA 


GTCACGAATT 


ATCCCTTTCT 


CATGCAATAA 


ATTCTTCGCG 


TAGTTGTACG 


GACGAAATGT 


GAAAGACATA 


GAACACTTTC 


TTTGGCCCTC 


ACGTAGAAGC 


CGTCCTCTTT 


CACGTTGTTC 


TTCTCGTCAT 


GCCTCCCTAA 


GCCTTATACC 


GCTCTCAAGG 


AGGAGCTTGA 


CCTCTAACAG 


CCTCCAACCA 


CCTCCCTATC 


TCGTCAGCTC 


CCAGACTTTT 


TCGTTTTCAG 


CTTTTTCCAT 


GCCTCAAGAT 


TTCTTCCAGG 


CTAGGATAGA 


GTTCTTAGCA 


TTACTAGGGG 


TACTGCCTGC 


AAGTTTCCTC 


ACTGGCCATT 


TTCAAACAAT 


TCCTTTTCCG 


TGAGACCATT 


TTTGCCCTCC 


CTAGAAGTAA 


TTACTCTCTT 


CATCATTTGA 


AAGAGGCGTT 


TTAGGGGATT 


GGCCCCTGGG 


ACCAGGGTTC 


GAGTCCCTGC 


CCGGCTACCT 


ACCCTAATAC 


CCCACTTCTA 


TCTTACAATT 


TCAGGTAAGT 



TGGCGAGGGC TAATAAGTTG 60 

GCATGAAATC GATTACCTCG 120 

TGCGTACATA TTTCGGCGCT 180 

TTATATCAGT CTTAGCTATA 240 

TAGTCCAGTT GAGCGTGTAT 300 

ACTCATTGAG AACCTTTACA 360 

AGACTAGCT(f TACCTCAATA 420 

CTGGAACCTT AAGATCAACA 480 

CCCCTTTCCA CTTGTAGAAC 540 

GCTTCTTCAG ATAATTGATA 600 

ATTCATAAAA TTCAATTAAT 660 

GGGAGTTTAG GGCAAATCCC 720 

CCTCCCCTAA CCAGGGCTTT 780 

TTGAAAGGTT AGGGGGATAC 840 

CTTTACTAGG TCAACTAAAG 900 



-212- 



CACCAACGTA AGTCTCCTTC GTCTTACCAC CTTGACTCTT CTTGATAAAG TAAACATAAT 960 

ATCATCCATA GACTTACCTT ATTCTTATAT TACCATATGA TTTTATTATT TTGTATTTCT 1020 

ATTAGATAAG TCCCACTCAT AGAACAAATG ATGGTTTTAA CTTATATACT AAATACTCTA 1080 

ATAACTCAAC AATAATAAGA ATTTAATCAG TTCTGATAAG TATTTTCACT CGAAAACATT 1140 
TAAATATATT AAGACATAAT TTCTATTTAA ACAGC ATG TTT TCG TTC GGT GGA AAT 1196 

Met Phe Se r Phe G I y Gl y Asn 

1 5 

ATT GAA AAA AAT AAA GGT ATC TTT AAG TTA TGG GCA CCT TAT GTT AAT 1244 

lie Glu Lys Asn Lys Gly lie Phe Lys Leu Trp Ala Pro Tyr Yal Asn 

10 15 20 

AGT GTT AAG CTG AAG TTA AGC AAA AAA CTT ATT CCA ATG GAA AAA AAC 1292 

S e r Val Lys Leu Lys Leu S e r Lys Lys Leu lie Pro Met Glu Lys Asn 

25 30 35 

GAT GAG GGA TTT TTC GAA GTA GAA ATA GAC GAT ATC GAG GAA AAT TTA 1340 

Asp Glu Gly Phe Phe Glu Yal Glu Me Asp Asp lie Glu Glu Asn Leu 

4 0 4 5 50 _. - 55 

ACC TAT TCT TAT ATT ATA GAA GAT AAG AGA GAG ATA CCT GAT CCC GCA 1388 

Thr Tyr Ser Tyr Me Me Glu Asp Lys Arg Glu Me Pro Asp Pro Ala 

60 65 70 

TCA CGA TAT CAA CCT TTA GGA GTT CAT GAC AAA TCA CAA CTT ATA AGA 1436 

Ser Arg Tyr Gin Pro Leu Gly Val His Asp Lys Ser Gin Leu Me Arg 

75 80 85 
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ACA GAT 
Thr Asp 

ATA ATA 
lie He 
105 
AAA GGA 
Lys Gly 
120 

GGA ATT 
Gly lie 

GGA TAC 
Gly Tyr 

CCA TGG 
Pro Trp 

GCC GTA 
Ala Val 
185 
TAC CTT 
Tyr Leu 
200 



TAT CAG 
Tyr Gin 
90 

TAT GAA 
Tyr Glu 

GTA ATA 
Yal lie 

GAA CTG 
Glu Leu 

GAT GGT 
Asp Gly 
155 
GAA TTG 
Glu Leu 
170 

ATT TTG 
lie Leu 

TTA GGA 
Leu Gly 



ATT CTT 
lie Leu 

CTC CAC 
Leu His 

GAA AAG 
Glu Lys 
125 
ATG CCT 
Met Pro 
140 

GTT TTT 
Yal Phe 

GCT AAG 
Ala Lys 

GAT GTT 
Asp Yal 



TTA GGT 
Leu Gly 
205 



GAC CTT GGA 
Asp Leu Gly 

95 

GTT GGT ACT 
Yal Gly Thr 
110 

TTA GAT TAC 
Leu Asp Tyr 

GTG GCA CAA 
Val Ala Gin 

CTA TAC GCA 
Leu Tyr Ala 
160 

CTA GTA AAC 
Leu Yal Asn 
175 

GTA TAT AAT 
Yal Tyr Asn 
190 

CCT TAT TTT 
Pro Tyr Phe 



AAA GTA AAA 
Lys Yal Lys 

TTT TCC CAA 
Phe Ser Gin 
115 

CTC AAG GAT 
Leu Lys Asp 
130 

TTT CCA GGG 
Phe Pro Gly 
145 

GTT CAA AAT 
Yal Gin Asn 

GAG GCA CAT 
Glu Ala His 

CAT ATA GGT 
His lie Gly 
195 

TCA GAC AGA 
Ser Asp A r g 
210 



ATA GAA GAT CTA 
lie Glu Asp Leu 
100 

GAA GGA AAT TTC 
Glu Gly Asn Phe 



1484 



CTA GGA 
Leu Gly 

AAT AGA 
Asn Arg 

ACT TAT 
Thr Tyr 
165 
AAA AGG 
Lys Arg 
180 

CCT GAG 
Pro Glu 



ATC ACA 
lie Thr 
135 
GAT TGG 
Asp Trp 
150 

GGC GGA 
Gly Gly 

GGA ATA 
Gly lie 

GGA AAT 
Gly Asn 



TAT AAA ACT CCA 
Tyr Lys Thr Pro 
215 



1532 



151 



1628 



1676 



1724 



1772 



1820 
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TGG GGA TTA ACA TTT AAT TTT GAT GAT AGG GGA TGT GAT CAA GTT AGA 1868 
Trp Gly Leu Thr Phe Asn Phe Asp Asp Aig Gly Cys Asp Gin Yal Arg 

220 225 230 

AAA TTC ATT TTA GAA AAT GTC GAG TAT TGG TTT AAG ACC TTT AAA ATC 1916 
Lys Phe lie Leu Glu Asn Yal Glu Tyr Trp Phe Lys Thr Phe Lys He 

235 240 245 

GAT GGT CTG AGA CTG GAT GCA GTT CAT GCA ATT TTT GAT AAT TCG CCT 1964 
Asp Gly Leu Arg Leu Asp Ala Yal His Ala lie Phe Asp Asn Ser Pro 

250 255 260 

AAG CAT ATC CTC CAA GAG ATA GCT GAA AAA GCC CAT CAA TTA GGA AAA 2012 
Lys His lie Leu Gin Glu lie Ala Glu Lys Ala His Gin Leu Gly Lys 

265 270 275 

TTT GTT ATT GCT GAA ACT GAT TTA AAT GAT CCA AAA ATA GTA AAA GAT 2060 
Phe Yal lie Ala Glu Ser Asp Leu Asn Asp Pro Lys Me Yal Lys Asp 
280 285 290 295 

GAT TGT GGA TAT AAA ATA GAT GCT CAA TGG GTT GAC GAT TTC CAC CAC 2108 
Asp Cys Gly Tyr Lys lie Asp Ala Gin Trp Yal Asp Asp Phe His His 

300 305 310 

GCA GTT CAT GCA TTC ATA ACA AAA GAA AAA GAT TAT TAT TAC CAG GAT 2156 
Ala Yal His Ala Phe lie Thr Lys Glu Lys Asp Tyr Tyr Tyr Gin Asp 

315 320 325 

TTT GGA AGG ATA GAA GAT ATA GAG AAA ACT TTT AAA GAT GTT TTT GTT 2204 
Phe Gly Arg lie Glu Asp lie Glu Lys Thr Phe Lys Asp Yal Phe Yal 
330 335 340 
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TAT GAT 
Tyr Asp 
345 
GTA GGT 
Yal Gly 
360 

GAT CAA 
Asp Gin 

GAT AAA 
Asp Lys 

TAT ATA 
Tyr lie 

TTT TTC 
Phe Phe 
425 
AGA GAA 
Arg G I u 
440 

GAG GAA 
Gtu G I u 



GGA AAG TAT TCT 
Gly Lys Tyr Ser 



GAT CTT 
Asp Leu 

GTA GGA 
Val Gly 

ACG ACA 
Thr Th r 
395 
CCG CTA 
Pro Leu 
410 

TTC TTC 
Phe Phe 



CCA CCA 
Pro Pro 
365' 
A AT AGA 
Asn Arg 
380 

TAC CTT 
Tyr Leu 

ATA TTT 
He Phe 

TCT GAT 
Ser Asp 



GGT AGA CTA AAG 
Gly Arg Leu Lys 
445 

GCG TTC TTA AAG 
Ala Phe Leu Lys 
460 



AGA TAC 
Arg Tyr 
350 

CGT AAA 
Arg Lys 

GGA AAT 
Gly Asn 

ATG GCA 
Met Ala 

ATG GGC 
Met Gly 
415 
TTC TCA 
Phe Ser 
430 

GAA AAT 
Gl u Asn 

ACT AAA 
Ser Lys 



AGA GGA 
Arg Gly 

TTT GTA 
Phe Yal 

GGG GAA 
Gly Glu 
385 
GCC ACA 
Ala Thr 
400 

GAG GAA 
Glu Glu 

GAT CCC 
Asp Pro 

AAT CAA 
Asn Gin 

CTT TCA 
Leu Ser 
465 



AGA ACT CAT 

Arg Thr His 
355 

GTC TTC ATA 

Yal Phe He 
370 

AGA CTT TCC 

Arg Leu Ser 

CTA TAT ATA 
Leu Tyr lie 

TAT TAT GAG 
Tyr Tyr Glu 
420 

GTA TTA ATT 
Yal Leu lie 
435 

ATG ATA GAT 
Met lie Asp 
450 

TGG AAA ATT 
Trp Lys lie 



GGT GCT CCT 2252 
Gly Ala Pro 

CAA AAT CAC 2300 
Gin Asn His 
375 

ATA TTA ACC 2348 
lie Leu Thr 
390 

CTC TCA CCG 2396 

Leu Ser Pro 

405 

ACG AAT CCT 2444 
Thr Asn Pro 

AAG GGT GTT 2492 
Lys Gly Val 

CCA CAA TCT 2540 
Pro Gin Ser 
455 

GAT GAG GAA 2588 
Asp Glu Glu 
470 
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GTT TTA GAT TAT TAT AAA CAA CTG ATA A AT ATC AGA AAG AGA TAT AAT 2636 
Yal Leu Asp Tyr Tyr Lys Gin Leu He Asn He Arg Lys A r g Tyr Asn 

475 480 485 

AAT TGT AAA AGG GTA AAG GAA GTT AGG AGA GAA GGG AAC TGT ATT ACT 2684 
Asn Cys Lys Arg Yal Lys Glu Yal Arg Arg Glu Gly Asn Cys lie Thr 

490 495 500 

TTG ATC ATG GAA AAA ATA GGA ATA ATT GCA TCG TTT GAT GAT ATT GTA 2732 
Leu lie Met Glu Lys lie Gly lie lie Ala Set Phe Asp Asp lie Yal 

505 510 515 

ATT AAT TCT AAA ATT ACA GGT AAT TTA CTT ATA GGC ATA GGA TTT CCG 2780 
He Asn Ser Lys lie Thr Gly Asn Leu Leu He Gly lie Gly Phe Pro 
520 525 530 535 

AAA AAA TTG AAA AAA GAT GAA TTA ATT AAG GTT AAC AGA GGT GTT GGG 2828 
Lys Lys Leu Lys Lys Asp Glu Leu lie Lys Val Asn Arg Gly Yal Gly 

540 545 550 

GTA TAT CAA TTA GAA TGAAAGATCG ACCATTAAAG CCTGGTGAAC CTTATCCTTT 2883 
Yal Tyr Gin Leu Glu 
555 

AGGGGCAACT TGGATAGAGG AAGAAGATGG AGTTAATTTT GTACTATTCT CTGAGAACGC 2943 
CACAAAAGTA GAACTGTTAA CGTACTCTCA GACTAGACAA GATGAGCCAA AGGAAATAAT 3003 
AGAACTTAGA CAGAGAACCG GAGATCTCTG GCATGTTTTT GTACCTGGTT TAAGACCAGG 3063 
TCAGTTGTAT GGGTACAGGG TGTATGGTCC ATATAAACCA GAGGAAGGGT TAAGGTTTAA 3123 
TCCTAATAAA GTACTGATAG ATCCTTATGC AAAAGCTATA AACGGATTAT TACTATGGGA 3183 
TGATTCGGTC TTTGGATATA AAATTGGAGA TCAGAACCAG GATCTCAGTT TCGATGAGAG 3243 
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AAAAGACGAT AAATTTATAC CTAAAGGGGT CATAATAAAT CCTTATTTTG ATTGGGAGGA 3303 

CGAGCATTTC TTCTTTAGAA GAAAGATACC TTTTAAGGAT AGTATAATTT ATGAGACACA 3363 

TATAAAAGGA ATAACTAAAT TAAGGCAAGA TTTACCGGAG AACGTTAGAG GCACTTTTTT 3423 

GGGTTTAGCA TCAGATACTA TGATTGATTA CCTAAAAGAT TTAGGAATTA CAACCGTTGA 3483 

GATAATGCCT ATTCAGCAAT TTGTAGATGA GAGATTCATT GTCGATAAAG GGTTAAAGAA 3543 

CTACTGGGGT TACAATCCGA TAAATTATTT CTCTCCTGAA TGTAGATACT CAAGCTC 3600 

Sequence Number : 8 
Sequence Length : 556 
Type of Sequence : Amino acid 
Strandedness : Single 
Topology : Linear 
Molecule Type : Protein 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Met Phe Ser Phe G I y Gly Asn lie Glu Lys Asn Lys Gly lie Phe Lys 
15 10 15 

Leu Trp Ala Pro Tyr Val Asn Ser Yal Lys Leu Lys Leu Ser Lys Lys 

20 25 30 

Leu lie Pro Met Glu Lys Asn Asp Glu Gly Phe Phe Glu Yal Glu lie 

35 40 45 

Asp Asp lie Glu Glu Asn Leu Tht Tyr Ser Tyr lie lie Glu Asp Lys 
50 55 60 
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Arg Glu lie Pro Asp Pro Ala Ser Arg Tyr Gin Pro Leu Gly Yal His 
65 70 75 80 

Asp Lys Ser Gin Leu lie Arg Thr Asp Tyr Gin lie Leu Asp Leu Gly 

85 90 95 

Lys Yal Lys lie Glu Asp Leu lie lie Tyr- Glu Leu His Yal Gly Thr 

1 00 1 05 HO 

Phe Ser Gin Glu Gly Asn Phe Lys Gly Yal He Glu Lys Leu Asp Tyr 

115 120 125 

Leu Lys Asp Leu Gly lie Thr Gly lie Glu Leu Met Pro Yal Ala Gin 

1 30 1 35 HO 

Phe Pro Gly Asn Arg Asp Trp Gly Tyr Asp Gly Yal Phe Leu Tyr Ala 
145 150 155 160 

Yal Gin Asn Thr Tyr Gly Gly Pro Trp Glu Leu Ala Lys Leu Yal Asn 

165 170 175 

Glu Ala His Lys Arg Gly lie Ala Yal lie Leu Asp Yal Yal Tyr Asn 

180 185 190 

His lie Gly Pro Glu Gly Asn Tyr Leu Leu Gly Leu Gly Pro Tyr Phe 

195 200 205 

Ser Asp Arg Tyr Lys Thr Pro Trp Gly Leu Thr Phe Asn Phe Asp Asp 

210 215 220 

Arg Gly Cys Asp Gin Val Arg Lys Phe lie Leu Glu Asn Yal Glu Tyr 
225 230 235 240 

Trp Phe Lys Thr Phe Lys Me Asp Gly Leu Arg Leu Asp Ala Val His 
245 250 255 
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Ala He Phe Asp Asn Ser Pro Lys His lie Leu Gin Glu lie Ala Glu 

260 265 270 

Lys Ala His Gin Leu Gly Lys Phe Yal lie Ala Glu Ser Asp Leu Asn 

275 280 285 

Asp Pro Lys lie Yal Lys Asp Asp Cys Gly Tyr Lys lie Asp Ala Gin 

290 295 300 

Trp Yal Asp Asp Phe His His Ala Yal His Ala Phe He Thr Lys Glu 
305 310 315 320 

Lys Asp Tyr Tyr Tyr Gin Asp Phe Gly Arg He Glu Asp He Glu Lys 

325 330 335 

Thr Phe Lys Asp Yal Phe Yal Tyr' Asp Gly Lys Tyr Ser Arg Tyr Arg 

340 345 350 

Gly Arg Thr His Gly Ala Pro Yal Gly Asp Leu Pro Pro Arg Lys Phe 

355 360 365 

Val Yal Phe He Gin Asn His Asp Gin Yal Gly Asn Arg Gly Asn Gly 

370 375 380 

Glu Arg Leu Ser He Leu Thr Asp Lys Thr Thr Tyr Leu Met Ala Ala 
385 390 395 400 

Thr Leu Tyr lie Leu Ser Pro Tyr He Pro Leu lie Phe Met Gly Glu 

405 410 415 

Glu Tyr Tyr Glu Thr Asn Pro Phe Phe Phe Phe Ser Asp Phe Ser Asp 

420 425 430 

Pro Val Leu lie Lys Gly Yal Arg Glu Gly Arg Leu Lys Glu Asn Asn 
435 440 445 
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Gln Met lie Asp Pro Gin Set Glu 

450 455 
Ser Trp Lys I ■ e Asp Glu Glu Yal 
465 470 
Asn lie Arg Lys Arg Ty r As n Asn 
485 

Arg Glu Gly Asn Cys lie Thr Leu 
500 

Ala Ser Phe Asp Asp lie Val He 
515 520 
Leu lie Gly He Gly Phe Pro Lys 

530 535 
Lys Val Asn Arg Gly Yal Gly Yal 
545 550 



Glu Ala Phe Leu Lys Ser Lys Leu 
460 

Leu Asp Tyt Tyr Lys Gin Leu He 
475 480 
Cys Lys Arg Yal Lys Glu Yal Arg 

490 495 
lie Met Glu Lys lie Gly He lie 
505 510 
Asn Ser Lys lie Thr Gly Asn Leu 
525 

Lys Leu Lys Lys Asp Glu Leu lie 
540 

Tyr Gin Leu Glu 
555 



Sequence Number : 9 

Sequence Length : 6 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 



} 
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Strain : KM1 
Sequence 

Yal He A r g Giu Ala Lys 
1 5 



Sequence Number : 10 

Sequence Length : 6 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

lie Ser lie Arg Gin Lys 
1 5 



Sequence Number : 11 
Sequence Length : 5 
Type of Sequence : Amino acid 
Strandedness : Single 
Topology : Linear 
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Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

He He Tyr Yal Glu 
1 5 

Sequence Number : 12 

Sequence Length : 5 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Met Leu Tyr Yal Lys 
l 5 
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Sequence Number : 13 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

lie Leu Set lie Asn G I u Ly s 
l 5 



Sequence Number : 14 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
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Sequence 

Yal Yal He Leu Thr GIu Its 
1 5 



Sequence Number : 15 

Sequence Length : 10 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asn Leu GIu Leu Ser Asp Pro A r g Yal Lys 
1 5 10 



Sequence Number : 16 

Sequence Length : 12 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 
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Type of Fragment : Internal fragment 
Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Met He lie Gly Thr Tyr A r g Leu Gin Leu Asn Lys 
1 5 10 

Sequence Number : 17 
Sequence Length : 9 
Type of Sequence : Amino acid 
Strandedness : Single 
. Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 
Strain : KM1 
Sequence 

Val Ala Yal Leu Phe Ser Pro He Val 
i 5 9 
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Sequence Number : 18 

Sequence Length : 11 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

lie Asn Me Asp Glu Leu lie lie Gin Ser Lys 
1 5 10 

Sequence Number : 19 

Sequence Length : 12 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
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Sequence 

Glu Leu Gly Yal Ser His Leu Tyr Leu Ser Pro He 
1 5 10 

Sequence Number : 20 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asp Glu Yal Phe Arg Glu Ser 
1 5 

Sequence Number : 21 

Sequence Length : 4 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 
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Type of Fragment : Internal fragment 
Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 
Asp Tyr Phe Lys 
1 

Sequence Number : 22 

Sequence Length : 7 

Type of Sequence : Amino acid 

Stranded n ess : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asp Gly Leu Tyr Asn Pro Lys 
l 5 



) 
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Sequence Number : 23 

Sequence Length : 8 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asp lie Asn Gly lie A r g Glu Cys 
1 5 



Sequence Number : 24 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
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Sequence 

Asp Phe Glu Asn Ph e Glu Lys 
1 5 

Sequence Number : 25 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asp Leu Leu Arg Pro Asn lie 
1 5 

Sequence Number : 26 

Sequence Length : 5 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 
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Type of Fragment : Internal fragment 
Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asp lie lie Glu Asn 
1 5 



Sequence Number : 27 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asp Asn lie Glu T y r A r g Gl y 
l 5 
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Sequence Number : 28 
Sequence Length : 18 
Type of Sequence : Nucleic acid 
Strandedness : Single 
Topology : Linear 

Molecule Type : Other nucleic acid (Synthesized DNA) 
Sequence 

YTCWCKRAAW ACYTCATC 

Sequence Number : 29 
Sequence Length : 20 
Type of Sequence : Nucleic acid 
Strandedness : Single 
Topology : Linear 

Molecule Type : Other nucleic acid (Synthesized DNA) 
Sequence 

GATAAYATWG ARTAYAGRGG 

Sequence Number : 30 

Sequence Length : 8 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 
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Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 
Strain : ATCC 33909 

Sequence 

A r g Asn Pro Glu Ala Tyr Th r Lys 
1 5 

Sequence Number : 31 

Sequence Length : 9 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Asp His Val Phe Gin Glu Ser His Ser 
1 5 

Sequence Number : 32 
Sequence Length : 8 
Type of Sequence : Amino acid 
Strandedness : Single 
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Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

He Thr Leu Asn Ala Thr Ser Thr 
1 5 

Sequence Number ; 33 

Sequence Length : 6 

Type of Sequence ; Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

lie lie He Val G I u Lys 
1 5 
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Sequence Number : 34 

Sequence Length : 11 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Leu Gin Gin Tyr Met Pro Ala Val Tyr Ala Lys 
1 5 10 

Sequence Number : 35 

Sequence Length : 5 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 



) 
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Asn Met Leu Glu 
1 5 

Sequence Number : 36 

Sequence Length : 13 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain ; ATCC 33909 
Sequence 

Lys lie Ser Pro Asp Gin Phe His Val Phe As n Gin Lys 
1 5 10 

Sequence Number : 37 

Sequence Length : 8 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 
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Strain : ATCC 33909 
Sequence 

Gin Leu Ala G I u Asp Phe Leu Lys 
1 5 

Sequence Number : 38 

Sequence Length : 10 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Lys He Leu Gly Phe Gin Glu Glu Leu Lys 
l 5 10 

Sequence Number : 39 

Sequence Length : 10 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 
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Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 
Strain : ATCC 33909 

Sequence 

lie Ser Val Leu Ser GIu Phe Pro Glu Glu 
1 5 10 

Sequence Number : 40 

Sequence Length : 9 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Leu Lys Leu Glu Glu Gly Ala lie Tyr 
1 5 

Sequence Number : 41 
Sequence Length : 8 
Type of Sequence : Amino acid 
Strandedness : Single 
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Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Glu Yal Gin lie Asn G I u Leu Pro 
1 5 

Sequence Number : 42 

Sequence Length : 5 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Asp His Se r A r g Me 
1 5 
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Sequence Number : 43 

Sequence Length : 6 

Type of Sequence : Amino acid 

Strandedness ; Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Asp Leu A r g Tyr Tyr Lys 
1 5 

Sequence Number : 44 

Sequence Length : 14 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 
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Asp Yal Tyr Arg Thr Tyr Ala Asn Gin He Yal Lys Glu Cys 
1 5 10 

Sequence Number : 45 

Sequence Length : 10 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : N-terminal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Thr Phe Ala Tyr Lys Ite Asp Gly Asn Glu 
15 10 

Sequence Number : 46 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 
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Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Leu Gly Pro Tyr Phe Ser Gin 
1 5 

Sequence Number : 47 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Asp Yal Phe Val Tyr Asp Gly 

l 5 

Sequence Number : 48 

Sequence Length : 19 

Type of Sequence : Amino acid 




Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source 

Organism : Sulfolobus solfataricus 

Strain : KM1 
Sequence 

Tyr Asn A r g lie Yal lie Ala Glu Ser Asp Leu Asn Asp Pro Arg Val 

1 5 10 15 

Yal Asn Pro 



Sequence Number : 49 

Sequence Length : 5 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Leu Asp Ty r Leu Ly s 
1 5 
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Sequence Number : 50 

Sequence Length : 17 

Type of Sequence : Amino acid 

Stranded n ess : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Lys Arg Glu He Pro Asp Pro Ah Ser Arg Tyr Gin Pro Leu Gly Ya! 

15 10 15 

His 

17 

Sequence Number : 51 

Sequence Length : 9 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
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Sequence 

Lys Asp Yai Phe Yal Tyr Asp Gly Lys 
1 5 

Sequence Number : 52 

Sequence Length : 9 

Type of Sequence : Amino acid 

Strandeclness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

His lie Leu Gin Glu He Ala Glu Lys 
1 5 

Sequence Number : 53 

Sequence Length : 10 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 
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Original Source : Sulfolobus acidocaldarius 
Strain : ATCC 33909 
Sequence 

Lys Leu Trp Ala Pro Tyr Yal Asn Ser Yal 
1 5 10 

Sequence Number : 54 

Sequence Length : 7 

Type of Sequence : Amino acid 

Strandedness : Single 

Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Met Ph e Ser Phe Gly Gly Asn 
1 5 

Sequence Number : 55 
Sequence Length : 14 
Type of Sequence : Amino acid 
Strandedness : Single 
Topology : Linear 
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Molecule Type : Peptide 

Type of Fragment.: Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Asp Tyr Try Tyr Gin Asp Phe Gly Arg He Glu Asp lie Glu 
1 5 10 

Sequence Number : 56 
Sequence Length : 7 
Type of Sequence : Amino acid 
Strandedness : Single 
- Topology : Linear 

Molecule Type : Peptide 

Type of Fragment : Internal fragment 

Original Source : Sulfolobus acidocaldarius 

Strain : ATCC 33909 
Sequence 

Lys lie Asp AU Gin Trp Val 
1 5 

Sequence Number : 57 

Sequence Length : 18 

Type of Sequence : Nucleic acid 
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Strandeclness : Single 
Topology : Linear 

, Molecule Type : Other nucleic acid (Synthesized DNA) 
Sequence 

AGCWAGKAGM TAYCARCC 

Sequence Number : 58 
Sequence Length : 24 
Type of Sequence : Nucleic acid 
Strandedness : Single 
Topology : Linear 

Molecule Type : Other nucleic acid (Synthesized DNA) 
Sequence 

YTTHCCATCR TAWACRAAWA CATC 



